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Abstract 

We study the distribution of descendants of a known personality, or of anybody else, as 
it propagates along generations from father or mother through any of their children. We ask 
for the ratio of the descendants to the total population and construct a model for the route 
of Distribution from Ancestors to Descendants (DAD). The population ratio r„ is found to 
be given by the recursive equation r„+i (2 — r„)r„ , that provides the transition from 
the n— th to the (n + l)th generation, ro = I/ATq and Nq is the total relevant population 
at the first generation. The number of generations it takes to make half the population 
descendants is logAro/log2 and additional ~ 4 generations make everyone a descendent 
(=the full descendant spreading time). These results are independent of the population 
growth factor even if it changes along generations. As a running example we consider the 
offspring of King David. Assuming a population between Nq = 10^ and 5 • 10^ of Israelites 
at King David's time 1000 BC), it took 24 to 26 generations (about 600-650 years, when 
taking 25 years for a generation) to make every Israelite a King David descendent. The 
conclusion is that practically every Israelite living today (and in fact already at 350-400 BC), 
and probably also many others beyond them, are descendants of King David. We note that 
this work doesn't deal with any genetical aspect. We also didn't take into account here any 
geo-social-demographic factor. Nevertheless, along tens of generations, about 120 from King 
David's time till today, the DAD route is likely to govern the distribution in communities 
that are not very isolated. 



1. Introduction 



The well known Galton- Watson (GW) process [H [2] investigates the extinction of surnames 
which propagate from father to son. We consider here a model which unlike the GW model 
depends on both parents, namely the offspring of a known personality which propagates 
through father or mother to their children and we ask for the ratio of the descendants to the 
total population. As an example we will consider the offspring of King David. 

King David lived about 3,000 years ago. We assume that a generation (from birth until 
marriage and children) is 25 years, and each married couple has 2g children, where g is the 
growth factor per generation. Let Nn denote the number of Israelites at the ra— th generation 
and Nn = Nq- ^" where n = 0, 1, 2, ■ ■ ■ , 120(= 3000/25). Let D„ denote the number of 
descendants (male and female) of King David at the n— th generation and C„ = — 
denote the non-descendants. We start at the first generation with Dq = 1, the dynasty 
founder. Our problem is to estimate the ratio r„ = Dn/Nn after n generations. This will 
show, in particular, that practically all Israelites today are descendants. It is possible that a 
family disappears after a few generations (discussed in Section 6), but we assume throughout 
the paper that it doesn't happen. In our example with what we know about King David 
and his son Solomon, we do not have to worry about that. However if we are not sure about 
that we can say, as we discuss later, that either all Israelites are his descendant or none. 
Therefore if there is one descendant then all are descendants. 

It will be shown that for a population of No it took log N^/ log 2 generations to make half 
the population descendants of King David. Additional four generations made all of them 
his descendants. That is the DAD full spreading time. The transitions region between low 
to high spreading ratio is very quick, a few generations. It is not only King David; the 
same relation exists regarding anyone else of his era (or any other early era) whose family 
survived in the first few generations (discussed in Section 6), including for example less 
admirable characters in the Bible like Nabal... Assuming a population between A^^o = 10^ 
and A'o = 5 - 10^ Israelites at King David's time (~ 1000 BC) [3J, it took 24 to 26 generations 
(600-650 years, when taking 25 years for a generation) to ensure that every Israelite was his 
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descendant. That means that every Israehte hving at 400 BC, the beginning of the era of 
the Second Temple in Jerusalem, was already a descendant of King David. 

An interesting feature of the DAD route is that the descendant population ratio and 
the spreading time depend on A^o but not on the population growth factor g even if g is 
generation dependent. 



2. The rule of passing from to D^+i 

Two presentations of the same result are given. 

The First Presentation: Let be the total number of males and females at the n— th 
generation of a certain community, and among them (Davidian) descendants of the 
dynasty we follow. The number of males and females are assumed equal. Further assume 
that we have A„/2 cards with all the female names, one name per card, and similarly 
in another box Nn/2 cards for all the males. Consider now a "match maker" picking up 
randomly and independently one card from each box, and combining them to a single card 
(marriage) with the two names on it. At this stage add to each of the two names information 
whether he (she) is or isn't a D-descendant. Throw away all cards where both names are 
non-descendants. The probability that a descendant married a descendant is Dn/Nn and for 
a descendant marrying a non-descendant it is {N^ — Dn)/Nn- 

The conditional expectation E{Dn+i\Dn) (the number of descendants at the {n + l)th 
generation conditioned on the number of descendants at the n—th generation) follows 
from the fact that for = 1 a married couple who are both descendants will have 2 de- 
scendants. In this case the number of descendants will not change. On the other had if 
a descendant married a non-descendant the one descendant in the n—th generation will 
generate two descendants for the (n -|- l)th generation. 
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Therefore, 



E{Dn+i\D„) = g 



the probabihty that 
a descendant married 
a descendant 



■Dn + 2g- 



Nr,. 



the probabihty that 
a descendant married 
a non-descendant 

gDn- 



■Dr 



(2..V. 



Note that Dn and Nn include males and females, and we assume throughout the paper that 
their numbers are equal. We also comment that this statistical model can be viewed as a 
random walk process, as discussed below in section 3. 

We can simplify the analysis by a simple renormalization procedure that shows right 
away that the DAD route is independent of the growth factor g. We normalize by setting: 



Dr 



Dr 



Cn 



Cn 



Nn-Dr 



(2..2) 



and then with Nn = Nq g"' we obtain from Eq. (2..1 ) 



E{Dr,+l\Dn) 



Dn 
Nn 



Dr,.. 



(2..3) 



The growth factor g is eliminated and therefore A^o is the only relevant quantity. Not only 
that, but the free of g property holds for the general case where g depends on n and then 
gn is the growth factor from the generation n to (n + 1). To realize that, we replace in the 
former procedure: 



g ^ gn , g"- ^ {go- gi- g2:.- gn- 



i=0 



(2-4) 



Then Nn = Nq ■ YYi=o 9i- Usually > 1, but at least we need for the first few generations 
gn > 1/2 to avoid extinction of Dn (discussed in Section 6), and for later generations gn to 
be bounded away from zero. With the normalization: 

n—l n—l 

Dn = Dn/l[gi, Cn = Cn/X{g:. (2..5) 

i=0 1=0 



Eq. (2..1) with g ^ gn transforms to Eq. (2. .3) which is free of all g^ 
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In section 3 we view this statistical process as a simple random walk model. We will 
show that E{DD ^ E^{Dn), and therefore the recursive Eq. (2. .3) can be written as: 



+1J 



2 - 



No 



EDr 



(2-6) 



The Second Presentation: We split the population into males and females, A^^" = = 
^Nrr , 0^^ = 01 = \Dn , = C/ = \Cn whcrc "m" and "f" stand respectively for male 
and female for the Dn (Dvidians) and the rest Cn = Nq — Dn ■ 

We continue to use the normalized population numbers C„ and A^o eliminating g 
from the equations since it doesn't affect the distribution. 

Then the normalized population ratios are: 

Nl No 

_ _ ^ 

N^ ~ M ~ No 



Dn 

N„ 



Cn No - Dn 



Nn. 



No 



(2-7) 



Now we sort the couple types and count them with their probabilities. (One can envision 
it again by the two card boxes described above or by a Roulette wheel procedure: The D^ 
and are randomly marked in the cycling wheel and the and are randomly marked 
in the stationary platform.) 

The couple types and their probabilities are: 



(1) 


jjm _ 

n 


n 




_ No 2 
2 n 




(2) 


Di - 

n 


n 




- Tn) = 


2 " 


(3) 


r^m 

~ 


n 




= f(l 


- Tn) 


(4) 


_ 






- Tn) = 


f(l 



;i-rn) 



(2. 



The first three types give for each couple two D-children, and the fourth one gives zero 
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D-children, giving altogether for the next generation: 

Dn+i = No{2 - r„)r„ = 



2-^] D 



which is the same equation as Eq. (2. .3). 



Upper bound estimate: 



We would have liked to solve (2.. 3) recursively for EDn up to n = A^. This however cannot 
be done since: 

E{Dl) = {ED^f + E{Dn - EDnf . 



and hence (Jensen's inequality) 



EDl > (EDr, 



The same holds for conditional expectations. Then by Eq. (2. .3): 

E\Dn) 



E{D^+i) < 2ED^ 
Therefore, we can solve recursively the equation: 

E Dn+l = 2 EDn 



Na 



{EJh 
No 



but the solution will be an upper bound to the solution of (2. .3) for EDn- 



(2..9) 



(2..10) 



(2..1i: 



In the next section we will show that {EDnY is a very good approximation to E{Dl 



and consequently Eq. (2. .6) will be a very good approximation for Nq >> 1. 



3. The random walk model for E[{Dn+i — Dn)\Dn 



We wish to evaluate the standard deviation of our statistical process at the nth generation 
that leads to the descendant population of the (n+ 1) generation. It is needed for evaluating 
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Eq. (2..9), and for that we apply a random walk model [21 S]. 



Consider the unordered collection of the pairs of cards described in section 2, but without 
those cards where both couple members are non-descendants. It will yield D„ pairs. Next 
arrange these Z)„ pairs randomly and independently as a sequence running from 1 to Dn- Let 
h 1 < < Dn denote this sequence. Card i can have on it a pair of Davidians or one Davidian 
and one non-Davidian. Set A(i) = 2 in case that a Davidian married a non=Davidian and 
= 1 if Davidian married a Davidian. The sequence = 1,2, constitutes 

a random walk process of Dn moves with a probability (Dn/Nn) to make 1 step (x 1 D 
descendants), and a probability (1 — Dn/Nn) to make 2 steps (x2 non-D descendants). 
The total average distance of the walk gives of next generation Davidian descendants Dn+i, 
given by Eq. (2. .3). A similar procedure can be attributed for Dn+i — Dn (successive years 



difference) obtained from Eq. (2..3): 



E[{Dn+l~Dn)\Dn)] 



2^ 



2iVo 



Dn. 



(3..1) 



Here the process is mapped to the standard random walk model where each move is of ±1 
(one forward or one backward) step. The total number of moves is Dn, each is either a 
unit step to the right with a probability p = {1 — Dn/INo) or a unit step to the left with a 
probability q = Dn/2NQ. 



The standard deviation of (-Dn+i 
by laS]: 



Dn) of this random walk process is known to be given 



^1/2 



iDn+l-Dn)-E[iDn+l-Dn)\Dn]Y\Dn\ = 



'D„ 1 



Dn\ f Dn 



2NoJ \2No 



(3..2) 
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but the left hand side of this equation is equal to: 



^1/2 



Dn+1 — 



1 2 



(3..3) 



which is the standard deviation of -Dn+i- 
Therefore: 



El/2< 




Dn+l — E{Dn+l\Dn) 









where c G (1/8, 1/4] is a number of order 1. 



2 — ^ \ D 

^ No ' 




No 



(3..4) 



Then from Eq. (2.. 9) we have: 

E{Dl^,\Dn) = E\Dn+i\D„) (l 



c 

iVo 



(3..5) 



valid for any n. Therefore E'^Dn ~ ED'^^ and we will have for the recursive equation (2..3): 

ED^ 



E{Dr. 



Then the descendant ratio r„ 



SDn _ EDn 



2 - 



EDr,. 



(3..6) 



(and ro = ) will be given by: 



A'n ~ A^o ^ ^ ~ A^o 

r„+i ^ (2 - r„)r„ (3. .7) 

The last two equation are key results from which we derive our main conclusions. 

It should be noted that the accumulated error in the recursive process along n (in our 
cases n = 1,2, ...,50), is negligible since the relative error we found for a step in n is very 

small, of the order of l/N^^"^ (for any n), and Nq ~ 10'^ — 10^. 
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4. Examples for the recursive equation solution 



We give below examples with plots of the descendant population ratio r„ ^ ^^i^"'^ = ^^m"^ 



along the generation number, given by the exact solution of the recursive equation (3..7): 
r„+i ^ (2 - r„)r„. 

The plots in Fig. 1 give the r„ dependance on n for various values of Nq (corresponding 
to various tq = 1/Nq). For a population of 10^, we get for the DAD full spreading time 
(the number of generations for the whole population to becomes descendants) about 20 (or 
20 X 25 = 500 years). For each additional factor of 10 in A^^o we need l/logj^g^ = 3.22 more 
generations. Thus even for a population of Nq = 10^ it takes only (9/log|o2) + 4 = 33.9 
generations (850 years) to the DAD full spreading time. 



0.8- 




Figure 1: Descendant population ratio r„ vs. n, obtained from the recursive equation - 
Eq. (3. .7), for various starting population numbers: Nq = I/vq = 10^,10^,10^ and 10^, 



(corresponding to plots from left to right, respectively). 
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5. Approximating the recursive equation with a 
differential equation 



We derive from the recursive equation (3.. 6) an approximated differential equation. It will 



have the advantage that is possess an explicit solution, although the fitting is not exact. 



We have from Eq. (3. .6): 



ED. 



n+l 



EDr 



(5-1) 



Replacing EDn with a continuous and differentiable function ^{n), we obtain the equation: 



dip{n) 
dn 



1-^1 



(5..2) 



Hence 



dn 



dif{n) 



(5..3) 



Set r = r G (0, 1) and tq = l/No , then 



dr 



r(l-r) Jq 



dn . 



(5..4) 



Hence 



n = In 



r/ro 



l-r)/(l-ro) 



In 



1 — r 



(5..5) 



Solving for r yields 



r{n) 



e" + iVo - 1 



(5..6) 



This solution is only a rough estimate for the recursive equation - Eq. (3..7), and deviates 



from its exact solution especially for low n including the transition region to r ^ 1. By 
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rescaling n n\n2, we obtain a modified equation: 



r(n) 



2" 



2" + A^o 



(5-7) 



that describes the initial descendant evolution more accurately. However, it increases the 



width of the transition region. Eqs. (5.. 6) and (5.. 7) provide a rule of thumb approximation 
that r(n) = 1/2 is obtained at 2" ^ A^^q. 



Fig. 2 shows plots of the solutions - Eqs. (5. .6) and (5. .7) of the two differential equations 



which are compared to the exact solution given by the recursive equation - Eq. (3..7). 




Figure 2: Descendant population ratio r„ obtained from: (a) the recursive equation (3.. 7) 
and the approximations (b) Eq. (5. .6), and (c) Eq. (5. .7) 
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6. Dynasty Extinction 



Our model uses a growth factor that can vary from generation to generation, but takes a fixed 
average number of children of m (= 2g) per family for the whole population. The overall 
DAD model that we presented gives an accurate description of the descendant distribution, 
but as we said, it needs attention to the first few generations. Then, when the dynasty starts 
to build up {Dn is a small number) it has fragility features just as the Galton- Watson (GW) 
process [U [2]. The GW process was presented to find the stability and the extinction of 
surnames (or families) that is dependent on the average number of sons (or children) per 
family. We already mentioned that we need m > 1 {g > 1/2), just as in the GW process, 
to eliminate the possibility for the D extinction. However even for larger g there is a certain 
nonzero probability that a dynasty disappears. We need for that the probabilities for the 
various number of children per two parents that includes a finite probability for having no 
children. The zero children probability causes the extinction. We give here a brief analysis 
that basically follows the GW process. 

For a Poissonic distribution the probability for the number of children k per family is pk = 
m^e~^ /k\. For example, for m = 2 {g = 1): = 0.135, pi = 0.27, p2 = 0.27, andps = 0.18. 
It is easy to realize that most of the extinction probability comes from a descendant chain 
of successive families along the generation line n, each with one child, terminated by a zero 
children family. The probabilit'y for that is the sum of the geometric series ^„PiPo=0-185. 
For m = 3 the sum is 0.058. We will see below that these numbers are very close to the results 
obtained by the GW calculation. In the GW process for Poissonic offspring distribution, the 
extinction probability can be obtained from the probability generating function [2]: 

= e"^(^'"-i) , (6..1) 

and the survival probabihty = 1 — s„ is given by: 

s„+i = l-e-'"^". (6..2) 

For m < 1 {g < 1/2) this recursion levels off to zero (i.e. family extinction). For m > 1 



it levels off to a nonzero value given by the nontrival fixed point of Eq. (6. .2): one of 



the two roots of s = 1 — e The trivial fixed point is always Sa = 0, and the second 
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one Sb, that depends on m, gives the survival probabihty. [When starting the recursion 
at So = 1/Nq it gives for this process the evolution along n of the descendant population 
ratio that asymptotically reaches s^.] For m = 2 {g = 1) children per family, the survival 
ratio is Sb = 0.797 (extinction ratio of 0.203). For m = 3 {g = 1.5), the survival ratio is 
0.94 (extinction ratio of 0.06) and for m = 4 (^f = 2) it is 0.99 = 99% (extinction ratio of 

0. 01 = 1%). The value of m can vary along the time and from place to place; the long term 
global population growth factor, however, is not much higher than g = 1. For g G (1, 1.2), 

1. e. m G (2,2.4), the global family survival ratio is Sb G (0.8,0.88) . 

The conclusion is that a certain fraction (l — Sb) of the population at the n = generation 
will eventually have no descendants. The extinction that can occur in the first few generations 
is related to the GW process. We note what is discussed in the next section, that the 
extinction part doesn't affect the basic DAD process that is governed by the average g for 
those (the most) who survive. Nevertheless, it means about being a descendant of King David 
(or of anybody else) that: either all Israelites are his descendant or there isn't even a single 
descendant; or: if there is for sure one descendant, then the whole population are descendants. 
We said that for a moderate population growth the extinction part is (10 — 20)%, meaning 
that there are no descendants for that fraction of the ancestors population. Nevertheless, 
as we said in the introduction, it is very likely that King David's dynasty survived the first 
few generations and then the DAD process ensured that they fully spread to the whole 
population. 



7. Forward vs. backward approaches; 

Descendants vs. ancestors distributions 



Our model was constructed by going from an ancestor to his descendants and thus look at 
the descendant distribution. Then it is possible to follow the path down from generation to 
generation, but the number of children per couple can vary and have some average value of 
2g. The Galton- Watson (GW) process [ijS] is an example to such analysis. We note the basic 
difference between the GW process and ours. The DAD process depends on the question to 
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whom a Davidian is married. A non-Davidian mate contributes m (= 2g) children per two 
parents, while an intra tribe marriage (Davidian to Davidian) gives only half of it, m/2 = g 
children. In the GW like process this question is irrelevant. Its focus is on the distribution of 
the number of children per family that affect the descendant statistics, but it doesn't include 
the interplay between intra and inter tribe marriage. Therefore the GW process is relevant 
in the DAD analysis in the first generations only to understand the fragility and extinction. 
At that stage the DAD process is anyhow similar to the GW one since the probability for 
a non-Davidian mate is almost 100%. This is the reason why we could use in Section 6 the 
GW analysis for evaluating the survival ratio. 

It is also possible to go along the opposite direction from a descendant back in time to 
his ancestors. Then the route follows the ancestors distribution rather than the descendants. 
The backward way is more common because of the simplicity to track genealogical family 
trees of more recent generations, and since then in the analysis the number of parents is a fixed 
value of 2. Nevertheless the different backward trajectories that can pass through the same 
ancestors have probabilistic aspects. This viewpoint alludes on another aspect, the number 
of paths that connect a descendant at the n—th generation to a specific ancestor. This issue 
depends on the growth factor and the degrees of inter versus intra community mobility. It 
is more likely that the connection path number is significantly higher within communities 
with a common social-geographic history. Derrida et al. P, ^ took the backward direction 
analysis and obtained much understanding on various sides of the process. We mention here 
only one of their results on the population ratio of ancestors at a backward generations given 
in Refs. [51 E]. The result there is related to the GW process giving for the non-ancestors 
population ratio at a backward generation i the recursive equation: Sj+i = e"*^'''"^). For the 
ancestor ratio Si = 1 — Sj, it becomes Sj+i = 1 — e~"^^^ (dependent on m or g), which is 



equivalent to Eq. (6.. 2) of the GW process, and so it differs from our approach. 
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8. Conclusions 



We have presented a model for the distribution of descendants along generations, the DAD 



route. The descendant population and the ratio are given by the recursive equations (3.. 6) 



and (3..7). The descendant ratio, given for a few examples in figure 1, is shown to reach 
1 in a relatively few generations. For an initial population of A^o the DAD full spreading 
time is ~ log A'o/ log 2 + 4, that gives about 20 generations (500 years) for A"o = 10^. Every 
additional factor of 10 in Nq adds l/log^^p^ = 3.32 generations to the DAD full spreading 
time. The basic DAD route behavior, in particular the descendant population ratio does not 
depend on the population growth factor g, but only on the initial population Nq. 

We have not included here any genetic or geo-social-demographic aspects. It is clear that 
DAD will not spread into and out of very isolated groups. Nevertheless, we saw how quick 
the spreading process is. For a small group of 10'^ or 10^ it takes about 14 and 17 generations 
(350 and 425 years) to reach a full descendant spreading ratio. For all Israelites at King's 
David time it took 24-26 generations (600-650 years) to make every Israelite his descendent. 
That therefore happened already at 400 BC, the beginning of the era of the Second Temple in 
Jerusalem. Even for the whole world population at King's David time (1000 BC), estimated 
as 5 X 10'', it is but 29.5 generations (740 years). Segregation of local communities can slow 
down the process, but only in a limited way for relatively short time. It would be sufficient 
that one descendant migrates to another community, say with a similar population number, 
to make the whole population descendants in a few generations. Therefore more globally one 
might say that even if mankind started with many Adams and Eves, it took relatively very 
short time to consider the whole founder group at any early Common Forefather - 

Super DAD - of all of us. 

For the future, the DAD route means that, assuming a reasonable population mobility, 
each of us on earth today {Nq =~ 5 x 10^) (beyond the low extinction percentage discussed in 
Section 6, and assuming that no catastrophic event happens, will be an ancestor of everyone 
in the world - on Earth and beyond?) in ~ log(5 x 10^)/ log 2 + 4 36 generations (~ 
900 years) from now. Can this picture lend a philosophical meaning to what is said about a 
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common forefather of mankind? We saw that all of us have common ancestors and eventually 
we will be ourselves the ancestors of everyone in the future in a relatively short period of 
time. They all were our Fathers and Mothers and they all will be our Sons Daughters. . . 

We finally note that after we first deposited the paper in ArXiv, our attention was drawn 
to the work of Derrida et al. on the same subject [HE]- We beheve that our paper adds new 
insight and results treating the process along the generation line rather than the backward 
way. We have added a short discussion on the difference between the approaches in Sec. 7. 
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